HOW LONG CAN IT TAKE FOR A QUANTUM CHANNEL TO FORGET 

EVERYTHING? 



ANDRE AHLBRECHT, FLORIAN RICHTER, AND REINHARD F. WERNER 



ABSTRACT. We investigate quantum channels, which after a finite number k of repeated 
applications erase all input information, i.e., channels whose fc-th power (but no smaller 
power) is a completely depolarizing channel. We show that on a system with Hilbert space 
dimension d, the order is bounded by k < d 2 — 1, and give an explicit construction scheme 
for such channels. We also consider strictly forgetful memory channels, i.e., channels 
with an additional input and output in every step, which after exactly k steps retain no 
information about the initial memory state. We establish an explicit representation for 
such channels showing that the same bound applies for the memory depth k in terms of the 
memory dimension d. 

1. Introduction 

Quantum channels are the mathematical description for the most general quantum in- 
formation processing operations. In this paper we consider the question how quantum 
information can be erased in an iterated process. In the simplest case, all information is 
lost after a single step of this process, i.e. the quantum channel completely depolarizes its 
initial state. Of course, the more interesting case is when the channel representing the one- 
step process acts non-trivially on the input system, but after a finite number of iterations 
leaves no information about the input system's initialization. We refer to such a channel as 
a root of a completely depolarizing channel (see figure [TJ. One of the main objectives in 
this article is to show how long it can possibly take until such an iteration is completely de- 
polarizing. An upper bound in terms of the system's dimension can be derived easily from 
the Jordan normal form of the channel, but since the process needs to represent a physical 
transformation, which is expressed by complete positivity of the corresponding map, it is 
not clear a priori whether this bound is attained by some channel. In order to show that this 
is indeed a tight bound, we develop an explicit construction scheme for maximal roots of 
completely depolarizing channels. 

One motivation to look at this problem stems from quantum memory channels [KW05 1. 
These are channels which account for correlations between successive uses of the chan- 
nel by introducing an additional system, referred to as the memory. A central question 
in this context is whether the influence of a fixed input on the memory dies out in time, 
i.e. whether the channel is forgetful or not [KW05 RZ09|. Moreover, if the impact of the 
memory input vanishes within a finite number of steps the channel is referred to as strictly 
forgetful. We will demonstrate a connection between the concept of roots of completely 
depolarizing channels and strictly forgetful memory channels. The idea is to consider the 
transformation of the memory as a function of the state of the external input system. When 
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the memory channel is strictly forgetful, this must be a root of a completely depolarizing 
channel for all system states. The converse, however, is not true: There are memory chan- 
nels which give a root of a completely depolarizing channel for all fixed inputs, but are not 
strictly forgetful for general sequences of possibly entangled input states. Nevertheless, by 
adapting our method to the setting of strictly forgetful memory channels it is possible to 
create a technique which yields all strictly forgetful memory channels. 



CDC 




Figure 1 . The channel S is a finite root of the completely depolarizing 
channel (CDC) since a finite number of iterations maps an arbitrary input 
to the maximally mixed one (d denotes the dimension of the quantum 
system). 



As a second application of our theory, we discuss the generation of finitely correlated 
spin-chain states |FNW92| by a root of completely depolarizing channels. If the channel 
is an k th root, one obtains so-called fc-dependent states |Pet90 l Mat98|, which are defined 
by the property that the output observables separated by more than k sites are independent. 

Our paper is organized as follows. We first set up some notation and background on 
quantum channels on finite dimensional systems. In Section|3]we derive the general upper 
bound, and then describe the construction of maximal roots. For qubits we give an exhaus- 
tive construction, and focus for general systems on the question how the Jordan structure 
of a maximal root can be realized by completely positive maps. In Section |4] we show 
how to obtain fc-dependent states, and in Section [5] we discuss the connection to memory 
channels. 



2. Quantum channels on finite dimensional systems 

The purpose of this section is to introduce notation and to give some necessary back- 
ground on the mathematical aspects of quantum channels. For a detailed introduction to 
this topic we refer the reader to Paulsen's book [Pau03|. 

Throughout this paper we deal exclusively with quantum systems which can be de- 
scribed by a finite dimensional Hilbert space H = C d . By we denote the set of linear 
operators on C d and the physical states of the system are represented by the set of density 
operators S(C d ) := {p £ M^, p > 0, tr(p) = 1}. Possible measurements on the system 
are associated with the set of hermitian operators Ai(C d ) :— {A £ M^, A* — A}, where 
A* denotes the adjoint of A E Mj. A quantum channel can be defined in two different 
ways, we can either regard it as a transformation of the physical states or as a transforma- 
tion of the measurements. The first point of view, also known as the Schrtidinger picture, 
corresponds to a linear mapping T* from the states on an input system Hi n to states of an 
output system H ou t, that is, 

(1) T* : S(H m ) ^ S(H out ) . 

In the Heisenberg picture, which is precisely the second point of view, the quantum channel 
is represented by a linear map T from the measurements on H ut to measurements on Hi n , 
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i.e., 

(2) T : M{U out ) h-> M(H in ) . 

The maps T and T* are equivalent representations of the same physical transformation iff 
all expectation values for measurements A performed on states p coincide after application 
of T respectively T* . Hence, the representations in Heisenberg and Schrodinger picture 
are connected by the duality relation 

(3) tr(T*(p)A)=tr(pT(A)). 

By linearity both maps extend to the whole space M^,, respectively M^ out . In order to 
represent physical transformations, the maps T* and T have to satisfy certain properties. 
Both have to be completely positive, that is, the extended maps T* eg id„ and T (g) id„, 
where id„ denotes the identity map on n-dimensional matrices, preserve positivity of op- 
erators. Additionally, it is often assumed that the quantum channel always generates an 
output when it is fed with a state of the input system. Mathematically, this is expressed 
by the assumption that T* is trace-preserving and T is unital. Note that one property is a 
consequence of the other and the duality relation. 

A channel which maps every input state to the same output state a is called a com- 
pletely depolarizing channel (CDC). We denote the Schrodinger picture representation of 
this channel by T*, its mathematical definition reads T* (p) — tr(p)<r, with a £ S(W ou t). 
The duality relation yields 

(4) tr(T;(p)A) = tr(p)tr(aA) = tr(ptr(<rA) • 1), 

thus, T a (A) = tiaA ■ 1 for every A £ M^, is the representation of a CDC in the Heisen- 
berg picture. The particular case where d in = d out = d and a = \\ leads to the CDC 
which is defined by T* /d (p) = tr(/?)^l and T 1/d (A) = tr(A)^l. We will refer to this 
channel as the bistochastic CDC. 

The mathematical theory of completely positive maps provides some important results 
leading us to different ways of specifying a quantum channel. Since we are interested 
in concatenable channels, we focus our attention in the following to channels with equal 
input and output system. The first statement is the famous theorem of Kraus [Kra83|, 
which proves that every completely positive map T admits a decomposition of the form 

(5) T(X) = K* a XK a with K <*K = 1 ■ 

a a 

We refer to {K a } as Kraus operators of the channel T. Note that this representation in- 
volves a unitary degree of freedom, i.e., the channels defined by {Ki} and {Ki} coincide, if 
there exists a unitary U such that Ki — UijKj holds. If we restrict to Kraus decompo- 
sitions with minimal numbers of Kraus operators, this is actually the only freedom we have 
in choosing the Ki. In other words, two minimal Kraus representations {Ki} and {Ki} are 
always connected by a unitary U and the formula Ki — J^j UijKj. Of course, two Kraus 
decompositions of a quantum channel T do not necessarily consist of the same number of 
Kraus operators. For example, consider the convex combination T = \T\ + (1 — A)T 2 
of two channels T\ and T-2 with Kraus operators {Kij} respectively {i^2,i}- Clearly, T 
can be written as a Kraus decomposition with operators {VAifi,,} lj{v / l — Ai^2,i} but in 
general there exists a Kraus decomposition with fewer operators. We define the minimal 
number of Kraus operators as the Kraus rank of the channel T and note that a Kraus de- 
composition is minimal iff the operators Ki are linearly independent. We will see in the 
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next section that a possible Rraus decomposition of the bistochastic CDC in the case of a 
qubit input and output system is T x n{X) = | $2i=o a i-^- (J i w i tn Pauli operators <jj. In 
fact, this result can trivially be extended to higher dimensions by replacing Pauli operators 
by Weyl operators. This implies that the Rraus rank of the bistochastic CDC is always d 2 , 
where d is the system's dimension. 

Another characterization arises if we consider Choi's theorem [Cho75|. The statement 
of the theorem is sometimes called the channel-state duality, since it predicates a map 
T is completely positive iff its corresponding Choi operator £t :— T ® id(|f2)(f2|) = 

3 Yjij T{\i){3 1) ® K)0'l> with 1^) = 71 Si K*)' is P ositive - In f act ' tne tmcs of £t* is 
normalized, hence, T is completely positive iff £y» is a state, we will refer to £y» as the 
Choi state of T. Since the relation between a channel T and its corresponding Choi state 
£t* is invertible, any state fully determines a channel and vice versa. The Choi state and 
Choi operator of a CDC are then given by 

1 1 rp 

(6) £ T: = a ®-l d and „ = -l d <g> a T , 

a a 

where a T denotes the transpose of a. Furthermore, the linearity of a channel T allows for 
representation of T by a matrix Dt- For that reason we equip the vector space with 
the Hilbert-Schmidt scalar product (A\B)hs '■= tr(A*B) and define the representation 
matrix of a channel as Dri,j {A l \T(A,j)) hs with {^li •••) ^d 2 } as operator basis and 
{^l 1 , A d } its dual basis defined via tr(A l * Aj) — Sij. We point out that usually there 
is no way to determine the complete positivity of a map solely from its representation 
matrix without any knowledge of the operator basis. However, if we choose the matrix 
units Eij := with i,j E {1, ...,d} as a basis for the representation we find that 

the representation matrix of T and its corresponding Choi operator £y are connected via 
Dr nm ki = (Enm \T(Eki)) = d(n ® fc|^r|m <g> I). Thus, the representation matrix of a 
certian CDC is given in this basis by 

(7) (E nm \T a {E kl )) = ti(E mn tr(aE kl )l) = -5 n<m (l\a\k) . 

The divisibility of quantum channels, that is, the property of a channel T to be decom- 
posable into two non-trivial channels Si and S2 has been investigated in reference fWC08 ]. 
In the paper at hand, we specify this investigations for the divisibility of a completely de- 
polarizing channels T a into a self-concatenation of identical maps, i.e. whether there exists 
a channel S and k E N such that S = T a . Finally, we draw some connections of this 
problem to other fields in quantum information theory. 

We close this section with a mathematical definition of a k th order root of a quantum 
channel. According to figure[T]we define: 

Definition 2.1 (Root of a Channel). A k th root of the channel T : M d h-> M. d is a channel 
S : M d h-> M d with 

(8) S k =T and S r ^Tforr < k, k,rEN. 
We refer to k as the order of a root. 

3. Roots of Completely Depolarizing Channels 

This section is started with some general comments about the construction of roots of a 
CDC. Our aim is to get roots with maximal number of necessary self-concatenations, i.e. 
maximal order roots. It will turn out that this maximal order is always d 2 — 1, where d 
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denotes the dimension of the underlying Hilbert space H. If we consider the bistochas- 
tic CDC it is always possible to construct a maximal order root of this channel. After 
characterizing all maximal roots of the bistochastic CDC for qubit systems we present a 
construction scheme leading to maximal roots of the bistochastic CDC in arbitrary dimen- 
sions. 

3.1. General upper bound. The aim of this section is to describe the general approach 
we take to construct roots of a CDC. In particular, we investigate what the highest possible 
order of a finite CDC-root in terms of the dimension of W can be. Comparing the three 
introduced representations it turns out that the matrix-representation of a channel is the 
most fruitful one to determine an upper bound for the maximal order of a root. 

Theorem 3.1 (Boundedness of the root order). Let S : M> M,j be a channel, which is 
a k th root of a completely depolarizing channel T a . Then k < d 2 — 1. 

Before we prove the theorem, we need the following statement from linear algebra. 

Lemma 3.2 (Jordan normal form). [HJ85) For every matrix TeMj) there is an invertible 
matrix R, such that 



(9) 
with 



T = R ^0 MXe^j RT l = RJR-\ 



(X 1 \ 



(io) MX) 



e M de 



X 1 

V a; 

where the matrix J is called the Jordan normal form ofT and the Ji(X) are the Jordan 
blocks of size dg corresponding to tlie eigenvalue X. The number of Jordan blocks with the 
eigenvalue X is the geometric multiplicity of it while the sum of the dimensions X)a=a 4 de 
is the algebraic multiplicity of the eigenvalue. 

If we express the property of S being a root of a CDC in terms of the representation 
matrix, we can establish the upper bound in the following way: 



Proof of Theorem \3. 1\ We consider T„ and S as operators on Mj, a space of dimension 
D = d 2 . The eigenvalues of the CDC channel T are 1 and 0. Hence for any eigenvalue A of 
S we have A fc € {0, 1}, so S likewise has only the eigenvalues 1 and 0. Since 1 is a simple 
eigenvalue of T, the eigenvalue 1 of S is also simple, and there will be no roots of unity. 
Hence the algebraic multiplicity of the eigenvalue of S is d 2 — 1. The only remaining 
question is the decomposition of this dimension into Jordan blocks. The root order will be 
the smallest k such that Je(0) k = for all i. The smallest power k for which Ji(0) k = is 
di. Hence the root order is the dimension of the largest Jordan block. Clearly, this becomes 
largest when there is only one block, i.e., when k = d 2 — 1. □ 

Although the above theorem gives an explicit upper bound for the order of a root of a 
CDC it does not answer the question whether this bound is attained by some channel S. 
In order to reach this bound we additionally need to care about complete positivity of S, 
which cannot be decided solely from the representation matrix of S. However, we will see 
in the next sections that there always exist channels which attain the upper bound d 2 — 1 
for the order of a root of the bistochastic CDC. Since for the bistochastic CDC T^j d and 
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, d are represented by the same map, we omit the * in the notation to distinguish between 
Heisenberg and Schrodinger picture for the rest of this section. 

3.2. All roots of the bistochastic qubit CDC. The most elementary case arises if we 
consider the input and output system to be qubits. According to theorem [XT] the highest 
possible order of a CDC-Root is three in this case. We will give an explicit characterization 
of the whole set of maximal qubit roots if T a is the bistochastic CDC, i.e. £7 = 41. Thereby 
we verify that there are indeed roots of the CDC of order three. 

In reference |RSW02 1 it is proven that every bistochastic qubit channel can be decomposed 

as 

(11) T{p) = U 1 K[U 2 pU* 2 ]U* 1 

where U\ and U 2 are unitaries and A is a Pauli diagonal channel, i.e. A[<7j] = Aj<7j with i 6 
{0, 1, 2, 3}. If we represent the set of possible qubit states p via the Bloch sphere p(f) = 
|(1 + fa), \\f\\ < 1 and translate the action of the maps induced by U\, U 2 and A to maps 
acting on r, we find that an arbitrary qubit channel can be written as a composition of a 
diagonal map L = diag{Ai, A2, A3} and two rotations: 

(12) T(p(f)) = i(l + (RiLR 2 f)£7), Ri e 50(3) 

Clearly, since the Ri are invertible, the rank of the linear map T is determined by the choice 
of the values {A^} of the diagonal map L. Thus, the rank of L completely determines the 
order of a potential root T. Indeed, let us assume T is a k th order root of the CDC, then 
k = 3 if the rank of L is two and k = 2 if the rank is one, as can be seen from the 
corresponding Jordan normal forms. 

To explore the possible configurations of the {A;} resulting in completely positive maps 
we need to have a closer look at the set the Pauli diagonal channels. It is a well-known 
fact that the set of possible Pauli diagonal channels form a tetrahedron [NR07|. To get 
a suitable characterization of the rank of the channels, we use the fact that every Pauli 
diagonal channel has a Kraus decomposition of the form T(X) = Yli=o Pi^i^^i- Some 
straightforward calculations show that these channels are indeed Pauli diagonal with the 
relations 

(13) A = Mo + Mi + M2 + M3 

Ai = Mo + Mi - M2 - M3 
A2 = Mo - Mi + M2 - M3 
A3 = Mo - Mi - M2 + M3 

between {Mi} and {Ai}. If, on the other hand, we consider a map T in terms of the A;, 
we can solve ( fT3) l for the Mi to get a Kraus decomposition of T. It is easy to see that the 
eigenvectors of the Choi operator £r :=T®id(|f2)(f2|) are the vectors | f2 j ) := 1 <g) a^ | fi ) 
with corresponding eigenvalue Mi- Hence, complete positivity of T is expressed by the 
condition that all Mi are positive numbers and unitality of T requires that the Mi add up to 
one, i.e. Ao = 1. This yields the inequalities 

(14) A! + A 2 + A 3 >-1 

Ai — A2 — A3 > — 1 

— Ai + \ 2 — A3 > —1 
-Ai - A 2 + A 3 > -1 
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T(X) = a Xa 




T(X) = a 2 Xa 2 



T{X) = o x Xo x 



T{X) = a 3 Xa 3 



Figure 2 . The tetrahedron of Pauli diagonal channels parameterized by 
the eigenvalues Ai with i = 1,2,3. The extremal points of the tetrahe- 
dron represent the configuration where exactly one of the Kraus weights 
Hi equals one. The squares inside the tetrahedron mark the configura- 
tions, where one of the eigenvalues of the channel is equal to zero. The 
bistochastic CDC is represented by the intersection point of all three 
squares. 

characterizing the tetrahedron formed by the set of admissible {A;}. These relations imply 
that the set of Pauli diagonal channels with one eigenvalue equal to zero can be represented 
by squares inside the tetrahedron (see figure [2). 

Furthermore, we find an interesting connection between the Kraus rank of a channel and 
the rank as a linear map for qubit channels. Starting at an extremal point of the tetrahedron 
we find a reversible channel with Kraus rank one. If we move on the line between two 
extremal points we increase the Kraus rank by one under preservation of the reversibility. 
In the middle of the line the rank is lowered by two, because it lies on the intersection of 
two squares. Hence we need at least three Kraus operators to construct a qubit channel 
with rank exactly one less than maximal. Since the Pauli diagonal map characterizes the 
rank completely this holds for all bistochastic qubit channels. 

The following theorem gives a complete characterization of maximal roots for qubit 
systems: 

Theorem 3.3 (Maximal qubit roots). A bistochastic channel T : S(C 2 ) h->- <S(C 2 ) is a 
maximal root of the bistochastic CDC, iffT is of the form 



where L has exactly two non-zero eigenvalues, R 2 € SO (3) is an arbitrary rotation and 
there exist angles </>,#€ [0, 2tt) such that 



(15) 



T(p(r)) = -(1 + (RjBiLRaf)*), R € SO(3) 




cos 9 
cos 4> sin 
sin cj) sin 



e 






s 
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If we choose L = diag(0, A 2 , A3) we have the restrictions |Aa ± A3 1 < 1 for complete 
positivity and tan (f> = — ^ tan 9. 

Proof. As already mentioned, for a maximal root of the bistochastic CDC we have to 
choose the Pauli diagonal map L in ( [T2| i such that it has exactly two non-zero eigenvalues. 
Without loss of generality we assume L = diag(0, A2, A3). In the following, we determine 
all roots of the bistochastic CDC whose decomposition ( fl2] > is such that R2 = 0. The 
general case reduces to this particular setting by the following argument: If R1LR2 is a 
general root of the bistochastic CDC we have 

(17) = (RiLR 2 ) n = R 2 - 1 (R 2 RiL)"R 2 , 

which means that RL, with R = R 2 Ri, is a root as well. On the other hand, if RiL is a 
root then 

(18) = (RiL) n = R 2 (R 2 - 1 R 1 LR 2 )"R 2 - 1 , 

and hence R 2 1 RiLR 2 is again a root. 

Hence, the task is to determine all three-dimensional orthogonal matrices R such that 
the composition 



(19) £ T = R-diag(0,A 2 ,A 3 ) 

is nilpotent. This is equivalent to saying that all eigenvalues of Dt equal zero. Since Dt 
is of the form 

(0 A 2 ri2 A 3 ri 3 
A 2 r 22 A 3 r 23 
A 2 r 32 A 3 r 33 

its characteristic polynomial can be written as x(z) = det(£>y — zt) = — z-(z 2 — ztr(A)- 
det(A)), where we introduced the submatrix 



(21) A 



A 2 r 22 A 3 r 23 
A 2 r 32 A 3 r 33 



Hence, the condition that all three eigenvalues of Dt equal zero is equivalent to the con- 
dition: 

(22) det(A)) = and tr(A)) = 

The first part of the condition is already satisfied if the 2x2 submatrix of R consisting of 
{T22, ^23, ^32, ^33} has a vanishing determinant. Therefore we choose the following ansatz: 



(23) R = I : a b I , a, b, z G R 

z ■ a z ■ b 

We get additional restrictions on {a, b, z} from the orthogonality of R, that is, the rows of 
R have to be normalized and mutually orthogonal. This leads to the conditions 

(24) a 2 + b 2 < 1 

z 2 (a 2 +b 2 ) < 1 

and 

(25) (z 2 + l)(a 2 + 6 2 ) = l. 
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These relations suggest a parametrization in terms of trigonometric functions. If we expand 
the rows and columns accordingly, we find that 

cos 9 — sin 9 

(26) R= | sin</> cos (f> sin 9 cos</>cos6> | with0,6>eIR, 

- cos 6 sin 6 sin 9 sin <b cos ( 



is a possible parametrization of all R e 50(3) which fulfill the first condition of ( |22| ). The 
second condition fixes a relation between the two variables, such that 9 can be an arbitrary 



angle and <fi has to satisfy tan <fi = — j 2 - tan ( 



□ 



3.3. Roots via perturbation. Inspired by the approach of the last section, one might try 
to imitate the construction of a maximal root via the composition of a rank-lowering and 
rotation maps in higher dimensions. For instance, one could replace the Pauli matrices by 
Weyl operators and consequently compose a rank-lowering Weyl diagonal map with some 
unitary channel representing the rotation. This procedure fails mainly due to a lack of a 
Bloch-sphere interpretation of the state space and the dimensional gap between SU (d) and 
SO(d 2 — 1), i.e., the unitary channels do not cover all possible SO(d 2 — l)-rotations of 
the state space in dimensions beyond d = 2. 

Nevertheless, our aim is to construct maximal roots of the bistochastic CDC for arbitrary 
system dimension d. For this purpose, we recall the fact, that the Jordan normal form of 
a maximal root is, in an appropriate basis, given by the direct sum of a projector on the 
maximally mixed state a = gl, representing the bistochastic CDC, and a maximal Jordan 
block to the eigenvalue zero. The key idea is now to consider this nilpotent Jordan block 
as a e-weighted perturbation of the bistochastic CDC in such a way that the complete 
positivity remains untouched. The following theorem shows, that this is indeed possible. 

Theorem 3.4. Let B := {Ai — A2, A d 2} be a basis of hermitian operators in 
Mrf and B* — {A 1 — if, A 2 , A d } its dual basis, with respect to the Hilbert-Schmidt 
scalar product tr(A J A4) — Sij. Then, for small enough the map 

d 2 -l 

(27) T e (X) = -j ltrX + £ Y1 Aitr{A i+1 *X) , 

i=2 

is completely positive and therefore a root of the bistochastic CDC of maximal order d 2 — 1. 

Proof. First we emphasize that the dual basis B* is hermitian as well. To verify this, 
consider the Ai as basis for the real vector space of all hermitian operators and construct 
the unique dual basis A 1 as a linear combination of the A{ with real coefficients. The key 
point is to choose e in such a way that T E becomes completely positive. A way to establish 
the existence of such an e is to consider the Choi operator £y £ and choose e such that £y e 
is positive. A straightforward calculation yields 

(28) fr^^ + sE^ 8 ^' 

i=2 

where A 1 denotes the complex conjugate of A 1 in a fixed basis of C d . Hence, the map T £ 
is completely positive if e is small enough to ensure 

d 2 -i 

(29) - 1 < ed A, ® A 1 ^ 1 =: p e . 
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Since the chosen basis of and its dual are hermitian this amounts to a comparison 
between the eigenvalues of the hermitian operators p e and 1. The continuity of the eigen- 
value distribution of p e assures the existence of a proper e for arbitrary d and therefore 
guarantees the complete positivity of T e for small enough e. □ 



The construction scheme presented in the proof of theorem 3.4 also applies for all CDCs 
T a such that a has full rank. If, however, the rank r of a is less than maximal this con- 
struction only leads to a root of order r 2 — 1. 

Of course, for a given basis Ai and its dual A % there is an explicit bound on e in terms 
of the eigenvalues of p £ . This bound seems to be decreasing with the dimension d, due 
to the enlargement of the spectral radius in ( p9] l, while going to larger dimensions. This 
suggests that all maximal roots of the bistochastic CDC in higher dimensions obtained 
by this construction get closer to the bistochastic CDC with increasing d. The following 
proposition refutes this statement: 

Proposition 3.5. For arbitrary dimension d€N there is always a maximal root T £ of the 
bistochastic CDC such that their cb-norm distance satisfies 

(30) \\T E -T± 1 \\ cb >'Ll±. 

d d 

Proof. We start with a further specification of the basis Ai and choose \\Aa ||oo = 1. This 
implies the following lower bound for the cb-norm difference between T £ and the bis- 
tochastic CDC: 

(3D \\T e -T^\\ cb > sup HT^AJ-IWAJIU 

> WTM-T^As)^ 

Hence, the cb-norm is bounded from below by (d— 1 ) /d if we are free to choose e 1 1 A2 \ \ 00 — 
(d — l)/d. Of course, this choice should not violate the complete positivity of T E . 

In order to show that this is indeed possible with some further restrictions to the choice 
of A 3 , we consider the transformation A. t n- 5~ l+ ' i A l and A 1 M> 8 % ~ Z A % with S e K\{0} 
and i > 3, which still gives a maximal root of the bistochastic CDC. The Choi operator 
£tv transforms under this map according to 

d 2 -l 

(32) Ct e >-> —i + £ -A 2 ®A^ + 5 £ - Ai®A*+i. 

4 = 3 

Since we get a maximal root of the bistochastic CDC for arbitrarily small but non-zero 
5, perturbation theory tells us that we only have to compare the eigenvalues of the first 
two parts of the sum when considering the limit 5 — > 0. In other words, we may neglect 
the influence of terms Ai (g) A l+1 with i > 2 on the eigenvalue problem if 6 is chosen 
sufficiently small. Hence, we have to establish the inequality 

(33) -l<deA 2 ®A^. 

By taking the operator norm on both sides of the inequality, we find the following sufficient 
criterion for complete positivity of T e : 

(34) IdlMIIAalUI^IU^l 

To obtain the restrictions imposed on A 3 by this inequality, we furthermore assume the 
basis B to be orthogonal, that is, A % = Ai/trAf. We denote the eigenvalues of A 3 by 
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a\, then, orthogonality to A\ = 1 requires J2i a 3 = 0- Additionally, A3 must satisfy 
max 1 0.3 1 = 1 to guarantee ||v4 3 = 1. With these notations 04) changes into 

i 

(35) leiii^i^ <______ . 

Thus, equation ( [33] l is satisfied and T e is completely positive, if we choose A3 such that 
d — 1 < J2i kll 2 - This can be satisfied in even dimensions by choosing 03 = (— 1)\ 
actually leading to a lower bound of 1 for the cb-norm difference. In odd dimensions we 
choose a\ = (— 1) ! for i < d and a d = 0. □ 



4. Finitely correlated construction of /c-dependent states 

The general concept of finitely correlated states and the occurring correlations are con- 
sidered in reference |FNW92|. We want to deal with the correlations that occur if a maxi- 
mal CDC-root S is used to generate a functional on the infinite spin chain. For this purpose 
we consider the quasi-local algebra A := (S^-oo generated by algebras of finite sub- 
sets Aa '■= ®zga -4s on fi n i te chain elements A C Z. A fc-dependent state oj is defined 
in the following way ||Pet90l|Mat98ll : 

Definition 4.1 (fc-dependent state). A state uj : A i-> C is called k-dependent if algebras 
separated by k or more sites are independent, i.e. 

(36) w A^^ ® 1 ® . . . ® l ®A [w+fc+1;00 ) = oj (v4 ( _ 00: „ ) ) oj (A [n+k+lt0o) ) . 

\ k-times / 

To understand the application of maximal CDC-roots in this context, we first recall that 
every channel T : 1— > M f j admits a Stinespring representation |Sti55|, i.e. 

(37) T(X) = V*{X®l k )V Vie M d ,, 

where V : C d h-> C d <8>C fe is an isometry, that is V* V = l f j. Since we want to concatenate 
the channel T, input dimension d = dim(Hi n ) and output dimension d! = dim(H ou t) will 
be equal. Furthermore, the dimension k of the ancilla system is equal to the Kraus rank of 
T. Keeping this representation in mind, we define the following map E^ : <Ei Mf. n- 
M d : 

(38) M A (X) = V*{X® A)V, 

where the isometry V is chosen to be the same as in ( f37] > and therefore Ex (X) = T(X) 
holds. The concatenation of several M Ai , with i = 1, . . . , n, together with any p e S{ 
defines a functional oj n : Mf n M> C via 



(39) oj n (A 1 ®A 2 <g> ...(g) An) = tr(pE Al 0E4 o ...oE 4 (l)). 

Due to the unitality of T, we can extend this functional to the positive half chain A+ 
0^ o M k via: 

(40) cj„ + i (A M 1) = tr (jM Al o E A2 o ... o E A „ oEi(l)) 

= tr(pE Al oE^ 2 o...oE A „(l)) 

= W„ (^-[l,n]j ■ 
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Furthermore, if we choose p as an invariant state of T, i.e. tr (pEi (X)) = tr (pX), we can 
extend the functional also to the negative half-chain A- := i= _ oo Mfc through setting: 



/ 



(41) w n+1 (1 ® A [1>n] ) = tr 



pEi oEjtj oEa 2 o ... o Ea„(1) 
= W„ (j4[l, n ]) • 

Combining these two extensions we define a functional on the infinite spin chain A. If we 
furthermore define the shift operator a by setting 

(42) a:A^A , (g) ... ® A„ ® 1) = 1 (g> Ai <g> • • • ® A n , 

we find that cj is a translation invariant state, i.e. u; = w o a. We refer to uj as finitely 
correlated state generated by (T, p). 

Now we choose the generating channel T as a fc th root of the bistochastic CDC, that 
is, lEj ~ 1 (X) = |trA". Our construction scheme for roots of the bistochastic CDC yields 
channels T for which the maximally mixed state is an invariant state, hence, we choose 
p = 4. This means we construct a finitely correlated state on an infinite spin chain with 
a certain dependency length. This length is equal to the order of the root, as the following 
calculation for the resulting functional lu shows: 

(43) u) I . . . ® A n O 1 O . ® t ®A n+d 2 ® . 

\ d 2 — 1-times 

= tr (p . . . o E A „_ 1 o m( - 1 o M An+d2 o . . . (1) 
= tr (p . . . o E A „_, (1)) tr (pE Aji+d2 o . . . (1)) 

= LO (A(_ 00jn )) W (^4[„+d 2 ,oo)) ■ 

This satisfies the form of ( f3"6] > and therefore the maximal roots generate d 2 — 1-dependent 
states on an infinite spin chain, with d the dimension of p. 

5. Memory channels 

Commonly it is assumed that successive uses of a channel are uncorrected in the sense 
that identical inputs at different time steps produce identical outputs. However, almost all 
real physical processes exhibit some correlation in time, i.e. the transformation of the states 
at some time t depends to some extent on the states at previous times t' < t. If we consider 
the repeated application of a quantum channel, e.g. sending photons through some fiber, 
these correlations can be taken into account by introducing an additional system A4, which 
we refer to as the memory system. For the sake of clarity we define S(Hi n ) =: A for the 
input system and S(W out ) =■ B for the output system. Then, the n-fold concatenation 
T n : M <E) A® n H> B® n ® M of the quantum memory channel T: M <S) A ^ B <g> M 
can be expressed via 

(44) T n = {id® 11 - 1 ® T) o .. o (id B ® T <g) idf 1 - 2 ) o (T <g> id®"- 1 ) , 

where idx ■ X —> X denotes the ideal or noiseless channel on system X, see figure [3] 
for an illustration. Due to this construction, the elements of the output algebras B will 
certainly be affected by the choice of the initial memory state p of the memory system M.. 
Indeed, it is a natural question to ask if all elements of the output system are influenced in 
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B x B 2 B, 




Figure 3. A quantum memory channel T and its n-fold concatenation 
T n . M. denotes the memory systems which is used to model interaction 
between the different concatenation steps. 



the same way or if the effect of the memory dies out after sufficiently many concatenation 
steps. This leads to the notion of forgetful memory channels, which are those quantum 
memory channels where the influence of the initialization of the memory systems vanishes 
exponentially with the number of time steps, see reference |KW05 1 for a precise definition. 
Our aim in this section is to construct forgetful memory channels where the effect of the 
initial memory system vanishes completely after a certain number of concatenations. We 
refer to such channels as strictly forgetful memory channels. The following definition 
expresses this in mathematical terms: 

Definition 5.1 (Strictly Forgetful Memory Channel). A quantum memory channel T is 
strictly forgetful, iff there is some n £ N, such that 

(45) ||tr B »n [T n (((TM,l - °M,2) ® Pays)] 111 = 

for all <tm,i, <jm,2 € S(M) anda sys € S(A® n ). 

This definition assumes that there is no entanglement between the initial memory state 
and the system. We refer to the minimal n, such that (j45j holds, as memory depth of the 
channel T. Equivalent to this definition is to say that the memory branch, i.e. the channel 

T M :S(M)<g>S(A® n )^S(M)defmedbyT M {a M ®<Tsys) ■= tx B ^[T n {a M ®^ S ys)], 
completely depolarizes the information of the memory input state, see fig. [4] Our aim is 
to construct strictly forgetful memory channels T with memory depth n, exploiting the 
results about n-th order roots of the CDC. Similarly to the section about the construction 
of maximal roots in arbitrary dimensions, we construct the forgetful memory channels by 
expressing the problem in terms of matrix equations. 

For this purpose we fix bases of operators on the memory channels input, output and 
memory system, i.e., {Mi} is a basis for A4 and {Ai} respectively {Bi} are bases of A 
respectively B, where the number i £ {1, d^} of operators correspond the respective 
dimension of the Hilbert space. The matrix representation of a memory channel T is then 
given through 



(46) 



(i,j\D T \k,l) := tr(AT* ® B'* T(M k ® A t )), 
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e e e 

f T M 



fTi,ff 2 s S{M) 




Csys e S (A® n ) 

Figure 4. To characterize the strict forgetfulness of a memory channel 
T, we consider the channel's n-fold concatenation, where we neglect 
the output system B® n (depicted by the bins). The memory channel 
is strictly forgetful, iff there is an n € IN, such that the output on the 
memory system for any two different input states cannot be distinguished 
via an arbitrary measurement. 



In what follows, we try to identify the parts of the matrix, which determine the forgetfulness 
of the corresponding memory channel T. For that purpose, we first consider the identity 

(47) tre®2 [T 2 (<7m ® <7i ® £72)] = tr B ®2 {ids <g) t) o <g) idgj (o M ®o\® a 2 ) 

= tr B T(tr B [T (a M <g) £7 X )] <g) <t 2 ) . 

If we apply this identity to the definition of the memory branch Tj^{aj^ <E) cr sys ) of the 
n-th concatenation acting on a separable input state , i.e. a sys = at ® ... ® (7„, we find the 
following term: 

Tm(<tm ® &sys) = tr B ® n [T„(t7^ <E> cti ® ... <g> ct„)] 

(48) = tr B [T(tr B [T(...T(tr B [TQ^ <g> CTl )] ® a 2 ) ® ... <g> <r n )]. 

By introducing the set of parameterized channels Tj^ >(Ti : S(A4) M- 5(.M) on the memory 
branch, where Tjk iCT< (oai) ; = tr B [T(eryvi <8> £7j)] with £7, € 5(^4), we can rewrite (j48j as 
concatenation of parameterized channels on the memory branch: 

(49) T M (<j M (g)<j S y S ) = TM,a n ° ■•• °T Mi(7l (<7 M ). 

We point out that so far we have just reformulated the description of the memory branch in 
terms of parametrized maps. If we now identify every Tj^.a-i with its matrix representation 
Dx M via the coefficients 

(50) (k\DT M ,. t 10 : = tr(M fc *tr B [T(M* ® a,)]), 

for some basis of operators {M^} on the memory system, we can express |49) as multipli- 
cation of parametrized matrices: 

( 51 ) D T M = D T M , an ■ D TMtTn i ■ D TMai 

Note that the left-hand-side of this equation implicitly depends on the system state a sys . 
Equation ( [51} turns out to be the crucial matrix equation to construct channels of memory 
depth n utilizing the results of n-th order CDC-roots. Indeed, by the definition of a strictly 
forgetful channel T, the left-hand side needs to represent a completely depolarizing chan- 
nel. Obviously, if the memory depth of T is n, then Tj^ a needs to be a root of a CDC of 
order n a < n for all a <G S(A) since we may choose a t = a for all i in ( |5T| >. However, it 
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is not enough to demand that Tm,<x is a ro °t of a CDC for all a E S(A) in order to con- 
struct a strictly forgetful memory channel. Indeed, there exist memory channels which are 
strictly forgetful for all system states of the form <T sys = <7® n but not for general system 
states, see the example at the end of this section. 

The main obstacle to overcome is now that ( |5"Tj ) needs to be completely depolarizing for 
all choices of the Oi. Before we tackle this problem, let us argue that strict forgetfulness 
for all separable states of the input system A® n implies strict forgetfulness for all elements 
of S(A® n ). Suppose T is strictly forgetful for all separable states a sys = a% ® ••• <X> cr n 
and choose an operator basis {& a } a =i,...,d 2 A °f A. such tnat eacn °a is a quantum state. An 
arbitrary, possibly entangled, state p sys can then be written as 

(52) p sys = ^2 c ai ... an a ai (g) . . . ® a an . 

ai ,a n 

Let <jm,i an d &M,2 t> e arbitrary states of the memory, then we get 

arl tr B ® n [T n (a M ,i (E> a ai (g> . . . (g> a a J 
a„ tr e«" [T n {oM,2 ® cr ai ® . . . <S) cr an ] 

tr e ®„ [T n {(JM,2 ® Psys)] , 

and hence T is also strictly forgetful for all p G S(A® n ). Thus, we can restrict to sep- 
arable states a sys without loss of generality, which means that everything boils down to 
assuring that ( |5"Tj ) equals a CDC. Hence, we need to have a closer look at the parametrized 
matrices TM,ai ■ To facilitate the derivation we choose the bases {Mi}, {Ai} and {Bi} to 
be hermitian (so the dual bases) and the identity as the first element for each of them. We 
then find for some fixed o^: 

(54) (k\D TM:ai |0 = tr M (M k tr B [T(M l ® a,)}) 




=:(*|Xi,i|i) T 2 =:(fe|X 1>r |/> 



This equation shows that for appropriate bases every matrix on the right-hand side of ( |5T| l 
can be expressed as composition of the submatrix X\ \ plus some state specific weighted 
sum of submatrices {X 12 ...d 2 } of the matrix D?- As already mentioned, the matrix 

(55) Dt m ,<7 = — ■ Xlj + d B ■ a r (a)X 1>r 

A r=2 

must necessarily represent a root of a CDC of order at most n for arbitrary a. Moreover, 
all rt-fold products of matrices D^ M ai with arbitrary cr/s must also represent the CDC. 



(53) 



tr B ®„ [T n (aM,l ® Psys)} = c ai „ 

= ^ c Ql .. 



16 



HOW LONG CAN IT TAKE FOR A QUANTUM CHANNEL TO FORGET EVERYTHING? 



This can be assured by fixing bases and choosing the matrix blocks according to 

where the J; are upper triangular matrices of dimension d 2 M — 1 and v is a real valued 
vector. By choosing those blocks appropriately it can be assured that the nilpotency order 
of the Ji is n, which, by our results on maximal roots of the CDC, is bounded from above 
by n < d 2 M — 1, This choice ensures that every D? M a . in ( [ST} is of the form 

< 5? ) Dtm.., = ( 1 ° , w ) • 

If we put this into the right-hand side of ( |5T| ) we find that the memory branch is indeed 
completely depolarizing for an arbitrary input state of the form a sys = a\ ® ... ® cr n and 
the necessary number of concatenation steps is upper bounded by d 2 M — 1. 

What remains is to show that complete positivity is not violated if we choose the sub- 
matrices Xi : k in the proposed way. Here we emphasize that the behavior of the mem- 
ory branch is just affected by the submatrices Xi & and we are completely free to choose 
X2...<p.k to assure complete positivity. Moreover, we are free to choose the matrix blocks 
Ji and v with arbitrarily small but non-zero norm, without disturbing the forgetfulness 
property of T. Hence, the matrix blocks J; and X 2 ...d 2 ,k can be considered as perturba- 
tion of the bistochastic completely depolarizing channel T\ on A, B and A4 defined via 
Ti{a MA ) = j^Ibm for all a MA G S{MA). 

A natural question to ask is whether this construction yields all strictly forgetful mem- 
ory channels or if there are examples which cannot be transformed to the case of upper 
triangular matrices by clever choice of a basis. It turns out that our construction indeed 
covers all strictly forgetful memory channels. 

Theorem 5.2. Let T : M. ® A h- > B ® M. be a strictly forgetful memory channel. For 
appropriate bases of A, B and M the memory branch is of the form ( |56| l. This implies that 
the memory depth of strictly forgetful memory channels is upper bounded by d? M — 1. 

Proof. We choose again bases {Ai}, {Bi] and {Mi} of A, B and M. such that the identity 
is the first element of the respective basis and the other elements of the basis are hermit- 
ian and tracefree. We adopt the notation of ( |5"5j ) and denote the matrices with elements 
(k, 1\Dt\1,t) by X\ >r , Let X\ r denote the matrix obtained from X\ tT by deleting first 
row and column. The first step in our proof is to verify that the memory channel T is 
strictly forgetful with memory depth at most n iff the matrix algebra generated by the 
matrices X\ ; is nilpotent. Since we have chosen the basis {Aj} hermitian, tracefree and 
A\ = 1, there are positive numbers r; such that 

(58) a=-A 1 +^2aiA l 

1=2 

is a quantum state for all \ai\ < rj. Thus, the condition that TM.a n ° • • • T]m,<ti is 
completely depolarizing for all o\ , . . . , a n implies that 

(59) Yl a h ...a ln X Ul ■ . . . ■ X Un =0, 

h,...,l„ 

where the equal 1/d if li = and |a;.| < r;. otherwise. If we consider this as a 
polynomial in the variables with matrix-valued coefficients we see that this equation 
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implies that all coefficients must vanish, that is, Xx^ ■ . . . ■ X\ i n = for all U. This 
proves that the algebra generated by the matrices X\ t i is nilpotent. 

By the theorem of Jacobson |Jac62] this already implies that this algebra is simultane- 
ously triangularizable, that is, there is a basis in which all matrices are upper triangular. 
The statement about the maximal memory depth of T follows trivially. □ 



The crucial point in the proof of theorem pT2] is to show that the algebra generated by 
the X\ : i is nilpotent. If were only able to prove that the subspace generated by the Xu is 
nilpotent, which translates into the property that the memory channel T is strictly forgetful 
for all system states of the form <r sys — <j® n , we could not conclude that T is strictly 



forgetful. In fact, there exist examples |MOR91 ] of nilpotent subspaces of matrices which 
are not simultaneously upper triangular. From such an example it is easy to construct a 
memory channel which is strictly forgetful for all a sys = a® n but not for general system 
states. Indeed, let all systems A, B and A4 be qubits and choose Pauli matrices as operator 
basis. Consider the following matrices 



(60) X 1 













(° 


b 




—a 





: 


Xl,3 = 








s 





a 






V o 








Xii — Xi. 



and assume all other matrix elements of Dt to be zero, except = 1 which 

represents the trace-preserving property of T. Again, for small enough a and b this is 
completely positive. For a state a £ S(A) we get the memory channel 



(61) 



Dn 



( 1 








° \ 








ba y (a) 








-aa x (a) 





ba y (a) 


V o 





aa x (a) 


° / 



where a x (a) respectively a y (a) denote the coefficients of a with respect to Pauli operator 
x respectively y. Obviously, Dt m is a root of the bistochastic CDC of order at most 
three, but the family of all Dt m „ is not simultaneously upper triangular. This is expressed 
by the fact that the sequence 



(62) 



Dn 



Dn 



( 






-ab) n 








(ab) r 



J 



where ip x respectively ip y denote the eigenstates of Pauli operators x respectively y with 
eigenvalue +1, never exactly equals the CDC, although it converges exponentially in n 
towards the CDC. 



Discussion 

Our construction implies that the maximal memory depth of the channel just depends on 
the dimension of the memory system. Under further assumptions the bound can sometimes 
be improved. For example, if the memory channel is assumed to be reversible, and hence 
given by a unitary operator, the memory branch is a homomorphism. Reversible qubit 
channels have been discussed in [RZ09 |, and the maximal memory depth was shown to 
be 2 < 3 = 2 2 — 1. More generally, one can see that the nesting of linear subspaces 
implicit in the Jordan decomposition has to be replaced in the reversible case by a nesting 
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of subalgebras. Since for these some dimensions are forbidden, one gets a tighter bound, 
namely depth < 2(d — 1) IIGRW1 . 
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